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(ill) NUMBER OF SEQUENCES : 16 

(iv) CORRESPONDENCE ADDRESS s 
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(C) CLASSIFICATIONS 
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(C) REFERENCE /DOCKET NUMBER: 04020/090001 

(ix) TELECOMMUNICATION INFORMATION s 

(A) TELEPHONES 617/542-5070 
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(C) TELEX: 200154 



(2) INFORMATION FOR SEQ ID NOsl: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2030 base pairs 

(B) TYPBs nucleic acid ^ 

(C) STRANDSDNESSs single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NOsl: 

TGGCTGGCAA TGGCCTTGCT GACCTCGAGC CGGGCCCACG TGGGGACCTT TGGAGCACAG 60 

CCTACGATCC TGGTGCAAGG CCGGTGGATG CAGAGGCCAG TCCATATACC ACCCAGGCCT 120 

GCGAGGAGCG TGGTCCCCAC CCATCCAGCC CATATGTGCA AGTGCCCTTG ACAGAGAGGC 180 
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TGGTCATATC 


CATGGTGACC 


ATTTATGGGC 


CAGAACAGGT 


CCCCATCTGC 


GCAGTGAACC 


240 


CTGTGCTGAG 


CACCTTGCAG 


ACGTGATCTT 


GCTTCGTCCT 


GCAGGACTGT 


GCGGGGGAGG 


300 


AAAATCCAAG 


AGGAAGAAGG 


ATCTACGGAT 


ATCCTGCATG 


TCCAAGCCAC 


CCGCACCCAA 


360 


CCCCACACCC 


CCCCGGAACC 


TGGACTCCCG 


GACCTTCATC 


ACCATTGGAG 


ACAGAAACTT 


420 


TGAGGTGCAG 


GCTGATGACT 


TGGTGACCAT 


CTCAGAACTG 


GGCCGTGGAG 


CCTATGGGGT 


480 


GGTAGAGAAG 


GTGCGGCACG 


CCCAGAGCGG 


GACCATCATG 


GCCGTGAAGC 


GGATCCGGGC 


540 


CACCGTGAAC 


TCACAGGAGC 


AGAAGCGGCT 


GCTCATGGAC 


CTGGACATCA 


ACATGCGCAC 


600 


GGTCGACTGT 


TTCTACACTG 


TCACCTTCTA 


CGGGGCACTA 


TTCAGAGAGG 


GAGACGTGTG 


660 


GATCTGCATG 


GAGCTCATGG 


ACACATCCTT 


GGACAAGTTC 


TACCGGAAGG 


TGCTGGATAA 


720 


AAACATGACA 


ATTCCAGAGG 


ACATCCTTGG 


GGAGATTGCT 


GTGTCTATCG 


TGCGGGCCCT 


780 


GGAGCATCTG 


CACAGCAAGC 


TGTCGGTGAT 


CCACAGAGAT 


GTGAAGCCCT 


CCAATGTCCT 


840 


TATCAACAAG 


GAGGGCCATG 


TGAAGATGTG 


TGACTTTGGC 


ATCAGTGGCT 


ACTTGGTGGA 


900 


CTCTGTGGCC 


AAGACGATGG 


ATCCCGGCTG 


CAAGCCCTAC 


ATGGCCCCTG 


AGAGGA7CAA 


960 


CCCAGAGCTG 


AACCAGAAGG 


GCTACAATGT 


CAAGTCCGAC 


GTCTGGAGCC 


TGGGCATCAC 


1020 


CATGATTGAG 


ATGGCCATCC 


TGCGGTTCCC 


TTACGAGTCC 


TGGGGGACCC 


CGTTCCAGCA 


1080 


GCTGAAGCAG 


GTGGTGGAGG 


AGCCGTCCCC 


CCAGCTCCCA 


GCCGACCGTT 


TCTCCCCCGA 


1140 


GTTTGTGGAC 


TTCACTGCTC 


AGTGCCTGAG 


GAAGAACCCC 


GCAGAGCGTA 


TGAGCTACCT 


1200 


GGAGCTGATG 


GAGCACCCCT 


TCTTCACCTT 


GCAGAAAACC 


AAGAAGACGG 


ACATTGCTGC 


1260 


CTTCGTGAAG 


AAGATCCTGG 


GAGAAGACTC 


ATAGGGGCTG 


GGCCTCGGAC 


CCCACTCCGG 


1320 


CCCTCCAGAG 


CCCCACAGCC 


CCATCTGCGG 


GGGCAGTGCT 


CACCCACACC 


ATAAGCTACT 


1380 


GCCATCCTGG 


CCCAGGGCAT 


CTGGGAGGAA 


CCGAGGGGGC 


TGCTCCCACC 


TGGCTCTGTG 


1440 


GCGAGCCATT 


TGTCCCAAGT 


GCCAAAGAAG 


CAGACCATTG 


GGGCTCCCAG 


CCAGGCCCTT 


1500 


GTCGGCCCCA 


CCAGTGCCTC 


TCCCTGCTGC 


TCCTAGGACC 


CGTCTCCAGC 


TGCTGAGATC 


1560 


CTGGACTGAG 


GGGGCCTGGA 


TGCCCCCTGT 


GGATGCTGCT 


GCCCCTGCAC 


AGCAGGCTGC 


1620 


CAGTGCCTGG 


GXGGATGGGC 


CACCGCCTTG 


CCCAGCCTGG 


ATGCCATCCA 


AGTTGTATAT 


1680 


TTTTTTAATC 


TCTCGACTGA 


ATGGACTTTG 


CACACTTTGG 


CCCAGGGTGG 


CCACACCTCT 


174^ 


ATCCCGGCTT 


TGGTGCGGGG 


TACACAAGAG 


GGGATGAGTT 


GTGTGAATAC 


CCCAAGACTC 


1800 


CCATGAGGGA 


GATCCCATGA 


GCCGCCCAAG 


GCCTTCCCCT 


GGCACTGGCA 


AACAGGGCCT 


1860 


CTGCGGAGCA 


CACTGGCTCA 


CCCAGTCCTG 


CCCGCCACCG 


TTATCGGTGT 


CATTCACCTT 


1920 


TCGTGTTTTT 


TTTAATTTAT 


CCTCTGTTGA 




TGCTTTATGG 


GTTTGGCTTG 


1980 


TTTTTCTTGC 


ATGGTTTGGA 


GCTGATCGCT 


TCTCCCCCAC 


CCCCTAGGGG 




2030 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) .TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ser Lye Pro Pro Ala Pro Aan Pro Thr Pro Pro Arg Asn Leu Asp 
1 5 10 15 

Ser Arg Thr Phe lie Thr lie Gly Asp Arg Met Phe Glu Val Glu Ala 
20 25 30 

Asp Asp Leu Val Thr lie Ser Glu Leu Gly Arg Gly Ala Tyr Gly Val 
35 40 45 

Val Glu Lys Val Arg His Ala Gin Ser Gly Thr lie Met Ala Val Lys 
50 55 60 

Arg lie Arg Ala Thr Val Asn Ser Gin Glu Gin Lys Arg Leu Leu Met 

65 70 75 80 

Asp Leu Asp lie Asn Met Arg Thr Val Asp Cys Phe Tyr Thr Val Thr 



Leu Met Asp Thr Ser Leu Asp Lys Phe Tyr Arg Lys Val Leu Asp Lys 
115 120 125 

Asn Met Thr He Pro Glu Asp He Leu Gly Glu He Ala Val Ser He 
130 135 140 

Val Arg Ala Leu Glu His Leu His Ser Lys Leu Ser Val He His Arg 
145 150 155 160 

Asp Val Lys Pro Ser Asn Val Leu He Asn Lys Glu Gly His Val Lys 
165 170 175 

Met Cys Asp Phe Gly He Ser Gly Tyr Leu Val Asp Ser Val Ala Lys 
180 185 190 

Thr Met Asp Ala Gly Cys Lys Pro Tyr Met Ala Pro Glu Arg He Asn 
195 200 205 

Pro Glu Leu Asn Gin Lys Gly Tyr Asn Val Lys Ser Asp Val Trp Ser 
210 215 220 

Leu Gly He Thr Met He Glu Met Ala He Leu Arg Phe Pro Tyr Glu 
225 230 235 240 

Ser Trp Gly Thr Pro Phe Gin Gin Leu Lys Gin Val Val Glu Glu Pro 
245 250 255 

Ser Pro Gin Leu Pro Ala Asp Arg Phe Ser Pro Glu Phe Val Asp Phe 
260 265 270 

Thr Ala Gin Cys Leu Arg Lys Asn Pro Ala Glu Arg Met Ser Tyr Leu 



85 



90 



95 



Phe Tyr Gly Ala Leu Phe Arg Glu Gly Asp Val Trp He Cys Met Glu 
100 105 110 
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275 280 285 

Glu Leu Met Glu His Pro Phe Phe Thr Leu His Lya Thr Lye Lys Thr 
290 295 300 

Aep lie Ala Ala Phe Val Lys Lya lie Leu Gly Glu Asp Ser 
305 . 310 315 

(2) INFORMATION FOR SBQ ID NOt3: 

(i) SEQUENCE CHARACTERISTICS l 

(A) LENGTH i 1602 base pairs 

(B) TYPBs nucleic acid 

(C) STRANDBDNBSS x single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION t SEQ ID NO: 3: 



TAGCTGCAGC 


ACAGCCTTCC 


CTAACGTTGC 


AACTGGGGGA 


AAAATCACTT 


TCCAGTCTGT 


60 


TTTGCAAGGT 


GTGCATTTCC 


ATCTTGATTC 


CCTGAAAGTC 


CATCTGCTGC 


ATCGGTCAAG 


120 


AGAAACTCCA 


CTTGCATGAA 


GATTGCACGC 


CTGCAGCTTG 


CATCTTTGTT 


GCAAAACTAG 


180 


CTACAGAAGA 


GAAGCAAGGC 


AAAGTCTTTT 


GTGCTCCCCT 


CCCCCATCAA 


AGGAAAGGGG 


240 


AAAATGTCTC AGTCQAAAGG 


CAAGAAGCGA 


AACCCTGGCC 


TTAAAATTCC 


AAAAGAAGCA 


300 


TTTGAACAAC 


CTCAGACCAG 


TTCCACACCA 


CCTAGAGATT 


TAGACTCCAA 


GGCTTGCATT 


360 


TCTATTGGAA 


ATCAGAACTT 


TGAGGTGAAG 


GCAGATGACC 


TGGAGCCTAT 


AATGGAACTG 


420 


GGACGAGGTG 


CGTACGGGGT 


GGTGGAGAAG 


ATGCGGCACG 


TGCCCAGCGG 


GCAGATCATG 


480 


GCAGTGAAGC 


GGATCCGAGC 


CACAGTAAAT 


AGCCAGGAAC 


AGAAACGGCT 


ACTGATGGAT 


540 


TTGGATATTT 


CCATGAGGAC 


GGTGGACTGT 


CCATTCACTG 


TCACCTTTTA 


TGGCGCACTG 


600 


TTTCGGGAGG 


GTGATGTGTG 


GATCTGCATG 


GAGCTCATGG 


ATACATCACT 


AGATAAATTC 


660 


TACAAACAAG 


TTATTGATAA 


AGGCCAGACA 


ATTCCAGAGG 


ACATCTTAGG 


GAAAATAGCA 


720 


GTTTCTATTG 


TAAAAGCATT 


AGAACATTTA 


CATAGTAAGC 


TGTCTGTCAT 


TCACAGAGAC 


780 


GTCAAGCCTT 


CTAATGTACT 


CATCAATGCT 


CTCGGTCAAG 


TGAAGATGTG 


CGATTTTGGA 


840 


ATCAGTGGCT 


ACTTGGTGGA 


CTCTGTTGCT 


AAAACAATTG 


ATGCAGGTTG 


CAAACCATAC 


900 


ATGGCCCCTG 


AAAGAATAAA 


CCCAGAGCTC 


AACCAGAAGG 


GATACAGTGT 


GAAGTCTGAC 


960 


ATTTGGAGTC 


TGGGCATCAC 


GATGATTGAG 


TTGGCCATCC 


TTCGATTTCC 


CTATGATTCA 


102<*^ 


TGGGGAACTC 


CATTTCAGCA 


GCTCAAACAG 


GTGGTAGAGG 


AGCCATCGCC 


ACAACTCCCA 


1080 


GCAGACAAGT 


TCTCTGCAGA 


GTTTGTTGAC 


TTTACCTCAC 


AGTGCTTAAA 


GAAGAATTCC 


1140 


AAAGAACGGC 


CTACATACCC 


AGAGCTAATG 


CAACATCCAT 


TTTTCACCCT 


ACATGAATCC 


1200 


AAAGGAACAG 


ATGTGGCATC 


TTTTGTAAAA 


CTGATTCTTG 


GAGACTAAAA 


AGCAGTGGAC 


1260 


TTAATCGGTT 


GACCCTACTG 


TGGATTGGTG 


GGTTTCGGGG 


TGAAGCAAGT 


TCACTACAGC 


1320 
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ATCAATAGAA AGTCATCTTT GAGATAATTT AACCCTGCCT CTCAGAGGGT TTTCTCTCCC 1380 

AATTTTCTTT TTACTCCCCC TCTTAAGGGG GCCTTGGAAT CTATAGTATA GAATGAACTG 1440 

TCTAGATGGA TGAATTATGA TAAAGGCTTA GGACTTCAAA AGGTGATTAA ATATTTAATG 1500 

ATGTGTCATA TGAGTCCTCA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1560 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA 1602 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTHS 334 amino acids 

(B) TYPES amino acid 

(C) STRANDBDNBSSi not relevant 
(0) TOPOLOGY* linear 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NOs4s 

Met Ser Gin Ser Lys Gly Lye Lye Arg Aan Pro Gly Leu Lye lie Pro 

1 5 10 15 

Lye Glu Ala Phe Glu Gin Pro Gin Thr Ser Ser Thr Pro Pro Arg Asp 
20 25 30 

Leu Asp Ser^.Lys Ala Cys lie Ser lie Gly Asn Gin Asn Phe Glu Val 
35 40 45 

Lye Ala Asp Asp Leu Glu Pro He Met Glu Leu Gly Arg Gly Ala Tyr 
50 55 60 

Gly Val Val Glu Lys Met Arg His Val Pro Ser Gly Gin He Met Ala 
65 70 75 80 

Val Lye Arg He Arg Ala Thr Val Asn Ser Gin Glu Gin Lys Arg Leu 
85 90 95 

Leu Met Asp Leu Asp He Ser Met Arg Thr Val Asp Cys Pro Phe Thr 
100 105 110 

Val Thr Phe Tyr Gly Ala Leu Phe Arg Glu Gly Asp Val Trp He Cys 
115 120 125 

Met Glu Leu Met Asp Thr Ser Leu Asp Lys Phe Tyr Lys Gin Val He 
130 135 140 

Asp Lye Gly Gin Thr He Pro Glu Asp He Leu Gly Lys He Ala Val 
145 150 155 160 

Ser He Val Lys Ala Leu Glu His Leu His Ser Lys Leu Ser Val He 
165 170 175 

His Arg Asp Val Lys Pro Ser Asn Val Leu He Asn Ala Leu Gly Gin 
ISO 1SS 190 

Val Lys Met Cys Asp Phe Gly He Ser Gly Tyr Leu Val Asp Ser Val 
195 200 205 

Ala Lys Thr He Asp Ala Gly Cys Lys Pro Tyr Met Ala Pro Glu Arg 
210 215 220 
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lie Asn Pro Glu Leu Asn Gin Lys Gly Tyr Ser Val Lys Ser Asp lie 
225 230 235 240 

Trp Ser Lou Gly lie Thr Met lie Glu Leu Ala lie Leu Arg Phe Pro 
245 250 255 

Tyr Asp sar Trp Gly Thr Pro Phe Gin Gin Leu Lys Gin Val Val Glu 
260 265 270 

Glu Pro Ser Pro Gin Leu Pro Ala Asp Lys Phe ser Ala Glu Phe Val 
275 280 285 

Asp Phe Thr Ser Gin Cys Leu Lys Lys Asn Ser Lys Glu Arg Pro Thr 
290 295 300 

Tyr Pro Glu Leu Met Gin His Pro Phe Phe Thr Leu His Glu Ser Lys 
305 310 315 320 

Gly Thr Asp Val Ala Ser Phe Val Lys Leu lie Leu Gly Asp 

325 330 

(2) INFORMATION FOR SEQ ID NOiSt 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3497 base pairs 

(B) TYPE: nucleic acid 

(C) SXRANDXDNSSSt single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



CTAGGGTCCC 


CGGCGCCAGG 


CCACCCGGCC 


GTCAGCAGCA 


TGCAGGGTAA 


ACGCAAAGCA 


60 


CTGAAGTTGA 


ATTTTGCAAA 


TCCACCTTTC 


AAATCTACAG 


CAAGGTTTAC 


TCTGAATCCC 


120 


AATCCTACAG 


GAGTTCAAAA 


CCCACACATA 


GAGAGACTGA 


GAACACACAG 


CATTGAGTCA 


180 


TCAGGAAAAC 


TGAAGATCTC 


CCCTGAACAA 


CACTGGGATT 


TCACTGCAGA 


GGACTTGAAA 


240 


GACCTTGGAG 


AAATTGGACG 


AGGAGCTTAT 


GGTTCTGTCA 


ACAAAATGGT 


CCACAAACCA 


300 


AGTGGGCAAA 


TAATGGCAGT 


TAAAAGAATT 


CGGTCAACAG 


TGGATGAAAA 


AGAACAAAAA 


360 


CAACTTCTTA 


TGGATTTGGA 


TGTAGT AATG 


CGGAGTAGTG 


ATTGCCCATA 


CATTGTTCAG 


420 


TTTTATGGTG 


CACTCTTCAG 


AGAGGGTGAC 


TGTTGGATCT 


GTATGGAACT 


CATGTCTACC 


480 


TCGTTTGATA 


AGTTTTACAA 


ATATGTATAT 


AGTGTATTAG 


ATGATGTTAT 


TCCAGAAGAA 


540 


ATTTTAGGCA 


AAATCACTTT 


AGCAACTGTG 


AAAGCACTAA 


ACCACTTAAA 


AGAAAACTTG 




AAAATTATTC 


ACAGAGATAT 


CAAACCTTCC 


AATATTCTTC 


TGGACAGAAG 


TGGAAATATT 


660 


AAGCTCTGTG 


ACTTCGGCAT 


CAGTGGACAG 


CTTGTGGACT 


CTATTGCCAA 


GACAAGAGAT 


720 


GCTGGCTGTA 


GGCCATACAT 


GGCACCTGAA 


AGAATAGACC 


CAAGCGCATC 


ACGACAAGGA 


780 


TATGATGTCC 


GCTCTGATGT 


CTGGAGTTTG 


GGGATCACAT 


TGTATGAGTT 


GGCCACAGGC 


340 


CGATTTCCTT 


ATCCAAAGTG 


GAATAGTGTA 


TTTGATCAAC 


TAACACAAGT 


CGTGAAAGGA 


900 


GATCCTCCGC 


AGCTGAGTAA 


TTCTGAGGAA 


AGGGAATTCT 


CCCCGAGTTT 


CATCAACTTT 


960 
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GTCAACTTGT 


GCCTTACGAA 


GGATGAATCC 


AAAAGGCCAA 


AGTATAAAGA 


GCTTCTGAAA 


1020 


CATCCCTTTA 


TTTTGATGTA 


TGAAGAACGT 


GCCGTTGAGG 


TCGCATGCTA 


TGTTTGTAAA 


1080 


ATCCTGGATC 


AAATGCCAGC 


TACTCCCAGC 


TCTCCGATGT 


ATGTCGATTG 


ATATCGTGCT 


1140 


ACATCAGACT 


CTAGAAAAAA 


GGGCTGAGAG 


GAAGCAAGAC 


GTAAAGAATT 


TTCATCCCGT 


1200 


ATCACAGTGT 


TTTTATTGCT 


CGCCCAGACA 


CCATGTGCAA 


TAAGATTGGT 


GTTCGTTTCC 


1260 


ATCATGTCTG 


TATACTCCTG 


TCACCTAGAA 


CGTGCATCCT 


TGTAATACCT 


GATTGATCAC 


1320 


AGA6TGTTAG 


TGCTGGTCAG 


AGAGACCTCA 


TCCTGCTCTT 


TTGTGATGAA 


CATATTCATG 


1380 


AAAT6TGGAA 


GTCAGTACGA 


TCAAGTTGTT 


GACTGTGATT 


AGATCACATC 


TTAAATTCAT 


1440 


TTCTAGACTC 


AAAACCTGGA 


GATGCAGCTA 


CTGGAATGGT 


GTTTTGTCAG 


ACTTCCAAAT 


1500 


CCTGGAAGGA 


CACAGTGATG 


AATGTACTAT 


ATCTGAACAT 


AGAAACTCGG 


GCTTGAGTGA 


1560 


GAA6AGCTTG 


CAGAGCCAAC 


GAGACACATT 


GCCTTCTGGA 


GCTGGGAGAC 


AAAGGAGGAA 


1620 


TTTACTTTCT 


TCACCAAGTG 


CAATAGATTA 


CTGATGTGAT 


ATTCTGTTGC 


TTTACAGTTA 


1680 


CAGTTGATGT 


TTGGGGATCG 


ATGTGCTCAG 


CCAAATTTCC 


TGTTTGAAAT 


ATCATCTTAA 


1740 


A7TAGAATGA ATTT^TCTTT 


ACCAAAAACC 


ATGTTGCGTT 


CAAAGAGGTG 


AACATTAAAA 


1800 


TATAGAGACA 


GGACAGAATG 


TGTTCTTTTC 


TCCTCTACCA 


GTCCTATTTT 


TCAAXGGGAA 


1860 


GACTCAGGAG 


TCTGCCACTT 


GTCAAAGAAG 


GTGCTGAXCC 


TAAGAATTTT 


TCATTCTCAG 


1920 


AATTCGGTGT 


GCTGCCAACT 


TGATGTTCCA 


CCTGCCACAA 


ACCACCAGGA 


CTGAAAGAAG 


1980 


AAAACAGTAC 


AGAAGGCAAA 


GTTTACAGAT 


GTTTTTAATT 


CTAGTATTTT 


ATCTGGAACA 


2040 


ACTTGTAGCA 


GCTATATATT 


TCCCCTTGGT 


CCCAAGCCTG 


ATACTTTAGC 


CATCATAACT 


2100 


CACTAACAGG 


GAGAAGTAGC 


TAGTAGCAAT 


GTGCCTTGAT 


TGATTAGATA 


AAGATTTCTA 


2160 


GTAGGCAGCA 


AAAGACGAAA 


TCTCAGTTGT 


TTGCTTCTTG 


CCATCACTGG 


TCCAGGTCTT 


2220 


CAGTTTCCGA 


ATCTCTTTCC 


CTTCCCCTGT 


GGXCTATTGT 


CGCTATGTGA 


CTTGCGCTTA 


2280 


ATCCAATATT 


TTGCCTTTTT 


TCTATATCAA 


AAAACCTTTA 


CAGTTAGCAG 


GGATGTTCCT 


2340 


TACCGAGGAT 


TTTTAACCCC 


CAATCTCTCA 


TAATCGCTAG 


TGTTTAAAAG 


GCTAAGAATA 


2400 


GTGGGGCCGA 


ACCGATGTGG 


TAGGTGATAA 


AGAGGCATCT 


TTTCTAGAGA 


CACATTGGAC 


2460 


CAGATGAGGA 


TCCGAAACGG 


CAGCCTTTAC 


GTTCATCACC 


TGCTAGAACC 


TCTCGTAGTC 


2520^ 


CATCACCATT 


TCTTGGCATT 


GGAATTCTAC 


TGGAAAAAAA 


TACAAAAAGC 


AAAACAAAAC 


2580 


CCTCAGCACT 


GTTACAAGAG 


GCCATTTAAG 


TATCTTGTGC 


TTCTTCACTT 


ACCCATTAGC 


2640 


CAGGTTCTCA 


TTAGGTTTTG 


CTTGGGCCTC 


CCTGGCACTG 


AACCTT AGGC 


TTTGTATGAC 


2700 


AGTGAAGCAG 


CACTGTGAGT 


GGTTCAAGCA 


CACTGGAATA 


TAAAACAGTC 


ATGGCCTGAG 


2760 


ATGCAGGTGA 


TGCCATTACA 


GAACCAAATC 


GTGGCACGTA 


TTGCTGTGTC 


TCCTCTCAGA 


2820 
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GTGACAGTCA 


TAAATACTGT 


CAAACAATAA 


AGGGAGAATG 


GTGCTGTTTA 


AAGTCACATC 


2880 


CCTGTAAATT 


GCAGAATTCA 


AAAGTGATTA 


TCTCTTTGAT 


CTACTTGCCT 


CATTTCCCTA 


2940 


TCTTCTCCCC 


CACGGTATCC 


TAAACTTTAG 


ACTTCCCACT 


GTTCTGAAAG 


GAGACATTGC 


3000 


TCTATGTCTG 


CCTTCGACCA 


CAGCAAGCCA 


TCATCCTCCA 


TTGCTCCCGG 


GGACTCAAGA 


3060 


GGAATCTGTT 


TCTCTGCTGT 


CAACTTCCCA 


TCTGGCTCAG 


CATAGGGTCA 


CTTTGCCATT 


3120 


ATGCAAATGG 


AGATAAAAGC 


AATTCTGGCT 


GTCCAGGAGC 


TAATCTGACC 


GTTCTATTGT 


3180 


GTGGATGACC 


ACATAAGAAG 


GCAATTTTAG 


TGTATTAATC 


ATAGATTATT 


ATAAACTATA 


3240 


AACTTAAGGG 


CAAGGAGTTT 


ATTACAATGT 


ATCTTTATTA 


AAACAAAAGG 


GTGTATAGTG 


3300 


TTCACAAACT 


GTGAAAATAG 


TGTAAGAACT 


GTACATTGTG 


AGCTCTGGTT 


ATTTTTCTCT 


3360 


TGTACCATAG 


AAAAATGTAT 


AAAAATTATC 


AAAAAGCTAA 


TGTGCAGGGA 


TATTGCCTTA 


3420 


TTTGTCTGTA 


AAAAATGGAG 


CTCAGTAACA 


TAACTGCTTC 


TTGGAGCTTT 


GGAATATTTT 


3480 


ATCCTGTATT 


CTTGTTT 










3497 


(2) I NFORMAT ION FOR SBQ ID NOt6* 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH* 363 amino acids 

(B) TYPE t amino acid 

(C) STRANDEDNESSt not relevant 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Gin Gly Lys Arg Lya Ala Leu Lye Leu Asn Phe Ala Asn Pro Pro 
1 5 10 15 

Phe Lya Ser Thr Ala Arg Phe Thr Leu Asn Pro Asn Pro Thr Gly Val 
20 25 30 

Gin Asn Pro His lie Glu Arg Leu Arg Thr His Ser lie Glu Ser Ser 
35 40 45 

Gly Lys Leu Lys lie Ser Pro Glu Gin His Trp Asp Phe Thr Ala Glu 
50 55 60 

Asp Leu Lys Asp Leu Gly Glu lie Gly Arg Gly Ala Tyr Gly Ser Val 
65 70 75 80 

Asn Lys Met Val His Lys Pro Ser Gly Gin lie Met Ala Val Lys Arg 
85 90 95 

lie Arg Ser Thr Val Asp Glu Lys Glu Gin Lys Gin Leu Leu Met Asp 
100 105 110 

Leu Asp Val Val Met Arg Ser Ser Asp Cye Pro Tyr lie Val Gin Phe 
115 120 125 

Tyr Gly Ala Leu Phe Arg Glu Gly Asp Cys Trp lie Cys Met Glu Leu 
130 135 140 

Met Ser Thr Ser Phe Asp Lys Phe Tyr Lys Tyr Val Tyr Ser Val Leu 
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145 150 155 160 

Asp Asp Val lie Pro Glu Glu lie Leu Gly Lys lie Thr Leu Ala Thr 
165 170 175 

Val Lye Ala Leu Asn His Leu Lys Glu Asn Leu Lys lie lie His Arg 
180 185 190 

Aap lie Lys Pro Ser Asn lie Leu Leu Asp Arg Ser Gly Asn He Lys 
195 200 205 

Leu Cys Asp Phe Gly He Ser Gly Gin Leu Val Asp Ser He Ala Lys 
210 215 220 

Thr Arg Asp Ala Gly Cys Arg Pro Tyr Met Ala Pro Glu Arg lie Asp 
225 230 235 240 

Pro Ser Ala Ser Arg Gin Gly Tyr Asp Val Arg Ser Aap Val Trp Ser 
245 250 255 

Leu Gly He Thr Leu Tyr Glu Leu Ala Thr Gly Arg Phe Pro Tyr Pro 
260 265 270 

Lys Trp Asn Ser Val Phe Asp Gin Leu Thr Gin Val Val Lys Gly Asp 
275 280 285 

Pro Pro GlQ>.Leu Ser Asn Ser Glu Glu Arg Glu Phe Ser Pro Ser Phe 
290 295 300 

He Asn Phe Val Asn Leu Cys Leu Thr Lys Asp Glu Ser Lys Arg Pro 
305 310 315 320 

Lvs Tyr Lys Glu Leu Leu Lys His Pro Phe He Leu Met Tyr Glu Glu 
325 330 335 

Arg Ala Val Glu Val Ala Cys Tyr Val Cys Lys He Leu Asp Gin Met 
340 345 350 

Pro Ala Thr Pro Ser Ser Pro Met Tyr Val Asp 
355 360 

(2) INFORMATION FOR SEQ ID NO* 7 J 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH) 3553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CAACAATGGC GGCTCCGAGC CCGAGCGGTG GCGGCGGCAG CGGCACCCCC GGCCCCGTAG 60 '\ 

GGTCCCCGGC GCCAGGCCAC CCGGCCGTCA GCAGCATGCA GGGTAAACGC AAAGCACTGA 120 

AGTTGAATTT TGCAAATCCA CCTTTCAAAT CTACAGCAAG GTTTACTCTG AATCCCAATC 180 

CTACAGGAGT TCAAAACCCA CACATAGAGA GACTGAGAAC ACACAGCATT GAGTCATCAG 240 

GAAAACTGAA GATCTCCCCT GAACAACACT GGGATTTCAC TGCAGAGGAC TTGAAAGACC 300 

TTGGAGAAAT TGGACGAGGA GCTTATGGTT CTGTCAACAA AATGGTCCAC AAACCAAGTG 360 
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GGCAAATAAT 


GGCAGTTAAA 


AGAATTCGGT 


CAACAGTGGA 


TGAAAAAGAA 


CAAAAACAAC 


420 


TTCTTATGCA 


TTTGGATGTA 


GTAATGCGGA 


GTAGTGATTG 


CCCATACATT 


GTTCAGTTTT 


480 


ATGGTGCACT 


CTTCAGAGAG 


GGTGACTGTT 


GGATCTGTAT 


GGAACTCATG 


TCTACCTCGT 


540 


TTGATAAGTT 


TTACAAATAT 


GTATATAGTG 


TATTAGATGA 


TGTTATTCCA 


GAAGAAATTT 


600 


TAGGCAAAAT 


CACTTTAGCA 


ACTGTGAAAG 


CACTAAACCA 


CTTAAAAGAA 


AACTTGAAAA 


660 


TTATTCACAG 


AGATATCAAA 


CCTTCCAATA 


TTCTTCTGGA 


CAGAAGTGGA 


AATATTAAGC 


720 


TCTGTGACTT 


CGGCATCAGT 


GGACAGCTTG 


TGGACTCTAT 


TGCCAAGACA 


AGAGATGCTG 


780 


GCTGTAGGCC 


ATACATGGCA 


CCTGAAAGAA 


TAGACCCAAG 


CGCATCACGA 


CAAGGATATG 


840 


ATGTCCGCTC 


TGATGTCTGG 


AGTTTGGGGA 


TCACATTGTA 


TGAGTTGGCC 


ACAGGCCGAT 


900 


TTCCTTATCC 


AAAGTGGAAT 


AGTGTATTTG 


ATCAACTAAC 


ACAAGTCGTG 


AAAGGAGATC 


960 


CTCCGCA6CT 


GAGTAATTCT 


GAGGAAAGGG 


AATTCTCCCC 


GAGTTTCATC 


AACTTTGTCA 


1020 


ACTTGTGCCT 


TACGAAGGAX 


GAATCCAAAA 


GGCCAAAGTA 


TAAAGAGCTT 


CTGAAACATC 


1080 


CCTTTATTTT 


GATGTATGAA 


GAACGTGCCG 


TTGAGGTCGC 


ATGCTATGTT 


TGTAAAATCC 


1140 


TGGATCAAAT 


GCCAQCTACT 


CCCAGCTCTC 


CCATGTATGT 


CGATTGATAT 


CGTGCTACAT 


1200 


CAGACTCTA6 


AAAAAAGGGC 


TGAGAGGAAG 


CAAGACGTAA 


AGAATTTTCA 


TCCCGTATCA 


1260 


CAGTGTTTTT 


ATTGCTCGCC 


CAGACACCAT 


GTGCAATAAG 


ATTGGTGTTC 


GTTTCCATCA 


1320 


TGTCTGTATA 


CTCCTGTCAC 


CTAGAACGTG 


CATCCTTGTA 


ATACCTGATT 


GATCACACAG 


1380 


TGTTAGTGCT 


GGTCAGAGAG 


ACCTCATCCT 


GCTCTTTTGT 


GATGAACATA 


TTCATGAAAT 


1440 


GTGGAAGTCA 


GTACGATCAA 


GTTGTTGACT 


GTGATTAGAT 


CACATCTTAA 


ATTCATTTCT 


1500 


AGACTCAAAA 


CCTGGAGATG 


CAGCTACTGG 


AATGGTGTTT 


TGTCAGACTT 


CCAAATCCTG 


1560 


GAAGGACACA 


GTGATGAATG 


TACTATATCT 


GAACATAGAA 


ACTCGGGCTT 


GAGTGAGAAG 


1620 


AGCTTGCACA 


GCCAACGAGA 


CACATTGCCT 


TCTGGAGCTG 


GGAGACAAAG 


GAGG AATTTA 


1680 


CTTTCTTCAC 


CAAGTGCAAT 


AGATTACTGA 


TGTGATATTC 


TGTTGCTTTA 


CAGTTACAGT 


1740 


TGATGTTTGG 


GGATCGATGT 


GCTCAGCCAA 


ATTTCCTGTT 


TGAAATATCA 


TGTTAAATTA 


1800 


GAATGAATTT 


ATCTTTACCA 


AAAACCATGT 


TGCGTTCAAA 


GAGGTGAACA 


TTAAAATATA 


1860 


GAGACAGGAC 


AGAATGTGTT 


CTTTTCTCCT 


CTACCAGTCC 


TATTTTTCAA 


TGGGAAGACT 


192CFV 


CAGGAGTCTG 


CCACTTGTCA 


AAGAAGGTGC 


TGATCCTAAG 


AATTTTTCAT 


TCTCAGAATT 


1980 


CGGTGTGCTG 


CCAACTTGAT 


GTTCCACCTG 


CCACAAACCA 


CCAGGACTGA 


AAGAAGAAAA 


2040 


CAGTACAGAA 


GOCAAAGTTT 


ACAGATGTTT 


TTAATTCTAG 


TATTTTATCT 


GGAACAACTT 


2100 


GTAGCAGCTA 


TATATTTCCC 


CTTGGTCCCA 


AGCCTGATAC 


TTTAGCCATC 


ATAACTCACT 


2160 


AACAGGGAGA 


AGTAGCTAGT 


AGCAATGTGC 


CTTGATTGAT 


TAGATAAAGA 


TTTCTAGTAG 


2220 
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GGAGCAAAAG 


ACCAAATCTC 


AGTTGTTTGC 


TTCTTGCCAT 


CACTGGTCCA 


GGTCTTCAGT 


2280 


TTCCGAATCT 


CTTTCCCTTC 


CCCTGTGGTC 


TATTGTCGCT 


ATGTGACTTG 


CGCTTAATCC 


2340 


AATATTTTGC 


CTTTTTTCTA 


TATCAAAAAA 


CCTTTACAGT 


TAGCAGGGAT 


GTTCCTTACC 


2400 


GAGGATTTTT 


AAGCCCCAAT 


CTCTCATAAT 


CGCTAGTGTT 


TAAAAGGCTA 


AGAATAGTGG 


2460 


GGCCCAACCG 


ATGTGGTAGG 


TGATAAAGAG 


GCATCTTTTC 


TAGAGACACA 


TTGGACCAGA 


2520 


TGAGGATCCG 


AAACGGCAGC 


CTTTACGTTC 


ATCACCTGCT 


AGAACCTCTC 


GTAGTCCATC 


2580 


ACCATTTCTT 


GGCATTGGAA 


TTCTACTGGA 


AAAAAATACA 


AAAAGCAAAA 


CAAAACCCTC 


2640 


AGCACTGTTA 


CAAGAGGCCA 


TTTAAGTATC 


TTGTGCTTCT 


TCACTTACCC 


ATTAGCCAGG 


2700 


TTCTCATTAG 


GTTTTGCTTG 


GGCCTCCCTG 


GCACTGAACC 


TTAGGCTTTG 


TATGACAGTG 


2760 


AA6CAGCACT 


GTGAGTGGTT 


CAAGCACACT 


GGAATATAAA 


ACAGTCATGG 


CCTGAGATGC 


2820 


AGGTGATGCC 


ATTACAGAAC 


CAAATCGTGG 


CACGTATTGC 


TGTGTCTCCT 


CTCAGAGTGA 


2880 


CAGTCAXAAA 


TACTGTCAAA 


CAATAAAGGG 


AGAATGGTGC 


TGTTTAAAGT 


CACATCCCTG 


2940 


TAAATTGCAG 


AATTCAAAAG 


TGATTATCTC 


TTTGATCTAC 


TTGCCTCATT 


TCCCTATCTT 


3000 


CTCCCCCACG GTATQCTAAA CTTTAGACTT CCCACTGTTC 


TGAAAGGAGA 


CATTGCTCTA 


3060 


TGTCTGCCTT 


CGACCACAGC 


AAGCCATCAT 


CCTCCATTGC 


TCCCGGGGAC 


TCAAGAGGAA 


3120 


TCTGTTTCTC 


TGCTGTCAAC 


TTCCCATCTG 


GCTCAGCATA 


GGGTCACTTT 


GCCATTATGC 


3180 


AAATGGAGAT 


AAAAGCAATT 


CTGGCTGTCC 


AGGAGCTAAT 


CTGACCGTTC 


TATTGTGTGG 


3240 


ATGACCACAT 


AAGAAGGCAA 


TTTTAGTGTA 


TTAATCATAG 


ATTATTATAA 


ACTATAAACT 


3300 


TAAGGGCAAG 


GAGTTTATTA 


CAATGTATCT 


TTATTAAAAC 


AAAAGGGTGT 


ATAGTGTTCA 


3360 


CAAACTGTGA 


AAATAGTGTA 


AGAACTGTAC 


ATTGTGAGCT 


CTGGTTATTT 


TTCTCTTGTA 


3420 


CCATAGAAAA 


ATGTATAAAA 


ATTATCAAAA 


AGCTAATGTG 


CAGGGATATT 


GCCTTATTTG 


3480 


TCTGTAAAAA 


ATGGAGCTCA 


GTAACATAAC 


TGCTTCTTGG 


AGCTTTGGAA 


TATTTTATCC 


3540 


TGTATTCTTG 


TTT 










3553 


(2) INFORMATION FOR SEQ ID NO: 8: 











(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 393 amino acida 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Ala Ala Pro Ser Pro Ser Gly Gly Gly Gly Ser Gly Thr Pro Gly 
15 10 15 

Pro Val Gly Ser Pro Ala Pro Gly His Pro Ala Val Ser Ser Met Gin 
20 25 30 
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Gly Lys Arg Lys Ala Leu Lys Leu Asn Phe Ala Asn Pro Pro Phe Lys 
35 40 45 

Ser Thr Ala Arg Phe Thr Leu Asn Pro Asn Pro Thr Gly Val Gin Asn 
50 55 60 

Pro His lie Glu Arg Leu Arg Thr His Ser lie Glu Ser Ser Gly Lys 
65 70 75 80 

Leu Lys lie ser Pro Glu Gin His Trp Asp Phe Thr Ala Glu Asp Leu 
85 90 95 

Lys Asp Leu Gly Glu He Gly Arg Gly Ala Tyr Gly Ser Val Asn Lys 
100 105 110 

Met: Val His Lys Pro Ser Gly Gin He Met: Ala Val Lys Arg He Arg 
115 120 125 

Ser Thr Val Asp Glu Lys Glu Gin Lys Gin Leu Leu Met Asp Leu Asp 
130 135 140 

Val Val Met Arg Ser Ser Asp Cys Pro Tyr He Val Gin Phe Tyr Gly 
145 150 155 160 

Ala Leu Phe Arg Glu Gly Asp Cys Trp He Cys Met Glu Leu Met Ser 

165 170 175 

Thr Ser Phe Asp Lys Phe Tyr Lys Tyr Val Tyr Ser Val Leu Asp Asp 
180 185 190 

Val He Pro Glu Glu He Leu Gly Lys He Thr Leu Ala Thr Val Lys 
195 200 205 

Ala Leu Met His Leu Lys Glu Asn Leu Lys He He His Arg Asp He 
210 215 220 

Lys Pro Ser Asn He Leu Leu Asp Arg Ser Gly Met He Lys Leu Cys 
225 230 235 240 

Asp Phe Gly He Ser Gly Gin Leu Val Asp Ser He Ala Lys Thr Arg 
245 250 255 

Asp Ala Gly Cys Arg Pro Tyr Met Ala Pro Glu Arg He Asp Phe Ser 
260 265 270 

Ala Ser Arg Gin Gly Tyr Asp Val Arg Ser Asp Val Trp ser Leu Gly 
275 280 285 

He Thr Leu Tyr Glu Leu Ala Thr Gly Arg Phe Pro Tyr Pro Lys Trp 
290 295 300 

Asn Ser Val Phe Asp Gin Leu Thr Gin Val Val Lys Gly Asp Pro Pro 
305 310 315 320 

Gin Leu Ser Asn Ser Glu Glu Arg Glu Phe Ser Pro Ser Phe He Asn 
325 330 335 

Phe Val Asn Leu Cys Leu Thr Lys Asp Glu Ser Lys Arg Pro Lys Tyr 
340 345 350 

Lys Glu Leu Leu Lys His Pro Phe He Leu Met Tyr Glu Glu Arg Ala 
355 360 365 
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Val Glu Val Ala Cya Tyr Val Cya Lye lie Leu Aap Gin Met Pro Ala 
370 375 380 

Thr Pro ser Ser Pro Met Tyr Val Aap 
385 390 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3576 baae paira 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : a ingle 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



CTCCCAACAA 


TGGCGGCTCC 


GAGCCCGAGC 


GGCGGCGGCG 


GCTCCGGGGG 


CGGCAGCGGC 


60 


AGCGGCACCC 


CCGGCCCCGT 


AGGGTCCCCG 


GCGCCAGGCC 


ACCCGGCCGT 


CAGCAGCATG 


120 


CAGGGTAAAC 


GCAAAGCACT 


GAAGTTGAAT 


TTTGCAAATC 


CACCTTTCAA 


ATCTACAGCA 


180 


AGGTTTACTC 


TGAATCCCAA 


TCCTACAGGA 


GTTCAAAACC 


CACACATAGA 


GAGACTGAGA 


240 


ACACACAGCA 


TTGAGTCATC 


AGGAAAACTG 


AAGATCTCCC 


CTGAACAACA 


CTGGGATTTC 


300 


ACTGCAGAGG 


acttoaaaga 


CCTTGGAGAA 


ATTGGACGAG 


GAGCTTATGG 


TTCTGTCAAC 


360 


AAAATGGTCC 


ACAAACCAAG 


TGGGCAAATA 


ATGGCAGTTA 


AAAGAATTCG 


GTCAACAGTG 


420 


GATGAAAAAG 


AACAAAAACA 


ACTTCTTATG 


GATTTGGATG 


TAGTAATGCG 


GAGTAGTGAT 


480 


TGCCCATACA 


TTGTTCAGTT 


TTATGGTGCA 


CTCTTCAGAG 


AGGGTGACTG 


TTGGATCTGT 


540 


ATGGAACTCA 


TGTCTACCTC 


GTTTGATAAG 


TTTTACAAAT 


ATGTATATAG 


TGTATTAGAT 


600 


GATGTTATTC 


CAGAAGAAAT 


TTTAGGCAAA 


ATCACTTTAG 


CAACTGTGAA 


AGCACTAAAC 


660 


CACTTAAAAG 


AAAACTTGAA 


AATTATTCAC 


AGAGATATCA 


AACCTTCCAA 


TATTCTTCTG 


720 


GACAGAAGTG 


GAAATATTAA 


GCTCTGTGAC 


TTCGGCATCA 


GTGGACAGCT 


TGTGGACTCT 


780 


ATTGCCAAGA 


CAAGAGATGC 


TGGCTGTAGG 


CCATACATGG 


CACCTGAAAG 


AATAGACCCA 


840 


AGCGCATCAC 


GACAAGGATA 


TGATGTCCGC 


TCTGATGTCT 


GGAGTTTGGG 


GATCACATTG 


900 


TATGAGTTGG 


CCACAGGCCG 


ATTTCCTTAT 


CCAAAGTGGA 


ATAGTGTATT 


TGATCAACTA 


960 


ACACAAGTCG 


TGAAAGGAGA 


TCCTCCGCAG 


CTGAGTAATT 


CTGAGGAAAG 


GGAATTCTCC 


1020 


CCGAGTTTCA 


TCAACTTTGT 


CAACTTGTGC 


CTTACGAAGG 


ATGAATCCAA 


AAGGCCAAAG 


108* 


TATAAAGAGC 


TTCTGAAACA 


TCCCTTTATT 


TTGATGTATG 


AAGAACGTGC 


CGTTGAGGTC 


1140 


GCATGCTATG 


TTTGTAAAAT 


CCTGGATCAA 


ATGCCAGCTA 


CTCCCAGCTC 


TCCCATGTAT 


1200 


GTCGATTGAT 


ATCGCTGCTA 


CATCAGACTC 


TAGAAAAAAG 


GGCTGAGAGG 


AAGCAAGACG 


1260 


TAAAGAATTT 


TCATCCCGTA 


TCACAGTGTT 


TTTATTGCTC 


GCCCAGACAC 


CATGTGCAAT 


1320 


AAG ATTGGTG 


TTCGTTTCCA 


TCATGTCTGT 


ATACTCCTGT 


CACCTAGAAC 


GTGCATCCTT 


1380 
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GTAATACCTG 


ATTGATCACA 


CAGTGTTAGT 


GCTGGTCAGA 


GAGACCTCAT 


CCTGCTCTTT 


1440 


TGTGATGAAC 


ATATTCATGA 


AATGTGGAAG 


TCAGTACGAT 


CAAGTTGTTG 


ACTGTGATTA 


1500 


GATCACATCT 


TAAATTCATT 


TCTAGACTCA 


AAACCTGGAG 


ATCCAGCTAC 


TGGAATGGTG 


1560 


TTTTGTCAGA 


CTTCCAAATC 


CTGGAAGGAC 


ACAGTGATGA 


ATGTACTATA 


TCTGAACATA 


1620 


GAAACTCGGG 


CTTGAGTGAG 


AAGAGCTTGC 


ACAGCGAACG 


AGACACATTG 


CCTTCTGGAG 


1680 


CTGGGAGACA AAGGAGGAAT 


TTACTTTCTT 


CACCAAGTGC 


AATAGATTAC 


TGATGTGATA 


1740 


TTCTGTTGCT 


TTACAGTTAC 


AGTTGATGTT 


TGGGGATCGA 


TGTGCTCAGC 


CAAATTTCCT 


1800 


OTTTOAAATA 


TCATGTTAAA 


TTAGAATGAA 


TTTATCTTTA 


CCAAAAACCA 


TGTTGCGTTC 


1860 


AAAGAGGTGA 


ACATTAAAAT 


ATAGAGACAG 


GACAGAATGT 


GTTCTTTTCT 


CCTCTACCAG 


1920 


TCCTATTTTT 


CAATGGGAAG 


ACTCAGGAGT 


CTGCCACTTG 


TCAAAGAAGG 


TGCTGATCCT 


1980 


AAGAATTTTT 


CATTCTCAGA 


ATTCGGTGTG 


CTGCCAACTT 


GATGTTCCAC 


CTGCCACAAA 


2040 


CCACCAGGAC 


TGAAAGAAGA 


AAACAGTACA 


GAAGGCAAAG 


TTTACAGATG 


TTTTTAATTC 


2100 


TAOTATTTTA 


TCTGGAACAA 


CTTGTAGCAG 


CTATATATTT 


CCCCTTGGTC 


CCAAGCCTGA 


2160 


TACTTTAGCC ATCAfAACTC 


ACTAACAGGG 


AGAAGTAGCT 


AGTAGCAATG 


TGCCTTGATT 


2220 


GATTA6ATAA 


AGAXTTCTAG 


TAGGCAGCAA 


AAGACCAAA? 


CTCAGTTGTT 


TGCTTCTTGC 


2280 


CATCACTGGT 


CCAGGTCTTC 


AGTTTCCGAA 


TCTCTTTCCC 


TTCCCCTGTG 


GTCTATTGTC 


2340 


GCTATGTGAC 


TTGCGCTTAA 


TCCAATATTT 


TGCCTTTTTT 


CTATATCAAA 


AAACCTTTAC 


2400 


AGTTAGCAGG 


GATGTTCCTT 


ACCGAGGATT 


TTTAACCCCC 


AATCTCTCAT 


AATCGCTAGT 


2460 


GTTTAAAAGG 


CTAAGAATAG 


TGGGGCCCAA 


CCGATGTGGT 


AGGTGATAAA 


GAGGCATCTT 


2520 


TTCTAGAGAC 


ACATTGGACC 


AGATGAGGAT 


CCGAAACGGC 


AGCCTTTACG 


TTCATCACCT 


2580 


GCTAGAACCT 


CTCGTAGTCC 


ATCACCATTT 


CTTGGCATTG 


GAATTCTACT 


GGAAAAAAAT 


2640 


ACAAAAAGCA 


AAACAAAACC 


CTCAGCACTG 


TTACAAGAGG 


CCATTTAAGT 


ATCTTGTGCT 


2700 


TCTTCACTTA 


CCCATTAGCC 


AGGTTCTCAT 


TAGGTTTTGC 


TTGGGCCTCC 


CTGGCACTGA 


2760 


ACCTTAGGCT 


TTGTATGACA 


GTGAAGCAGC 


ACTGTGAGTG 


GTTCAAGCAC 


ACTGGAATAT 


2820 


AAAACAGTCA 


TGGCCTGAGA 


TGCAGGTGAT 


GCCATTACAG 


AACCAAATCG 


TGGCACGTAT 


2880 


TGCTGTGTCT 


CCTCTCAGAG 


TGACAGTCAT 


AAATACTGTC 


AAACAATAAA 


GGGAGAATGG 


294C£y. 

A 


TGCTGTTTAA AGTCACATCC 


CTGTAAATTG 


CAGAATTCAA 


AAGTGATTAT 


CTCTTTGATC 


3000 


TACTTGCCTC 


ATTTCCCTAT 


CTTCTCCCCC 


ACGGTATCCT 


AAACTTTAGA 


CTTCCCACTG 


3060 


TTCTGAAAGG 


AGACATTGCT 


CTATGTCTGC 


CTTCGACCAC 


AGCAAGCCAX 


CATCCTCCAT 


3120 


TGCTCCCGGG 


GACTCAAGAG 


GAATCTGTTT 


CTCTGCTGTC 


AACTTCCCAT 


CTGGCTCAGC 


3180 


ATAGGGTCAC 


TTTGCCATTA 


TGCAAATGGA 


GATAAAAGCA 


ATTCTGGCTG 


TCCAGGAGCT 


3240 
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AATCTGACCG TTCTATTGTG TGGATGACCA CATAAGAAGG CAATTTTAGT GTATTAATCA 3300 

TAGATTATTA TAAACTATAA ACTTAAGGGC AAGGAGTTTA TTACAATGTA TCTTTATTAA 3360 

AACAAAAGGG TGTATAGTGT TCACAAACTG TGAAAATAGT GTAAGAACTG TACATTGTGA 3420 

GCTCTGGTTA TTTTTCTCTT GTACCATAGA AAAATGTATA AAAATTATCA AAAAGCTAAT 3480 

GTGCAGGGAT ATTGCCTTAT TTGTCTGTAA AAAATGGAGC TCAGTAACAT AACTGCTTCT 3540 

TGGAGCTTTG GAATATTTTA TCCTGTATTC TTGTTT 3576 
(2) INFORMATION FOR SEQ ID NOtlO: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH* 399 amino acids 

(B) TTMi amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY t linear 

(xi) SEQUENCE DESCRIPTIONS SSQ ID NO: 10: 

Mat Ala Ala Pro Ser Pro Ser Gly Gly Gly Gly ser Gly Gly Gly Ser 
1 5 10 15 

Gly Ser Gly Thr Pro Gly Pro Val Gly Ser Pro Ala Pro Gly His Pro 
^.20 25 30 

Ala Val Ser Ser Met Gin Gly Lys Arg Lys Ala Leu Lys Leu Asn Phe 
35 40 45 

Ala Asn Pro Pro Phe Lys Ser Thr Ala Arg Phe Thr Leu Asn Pro Asn 
50 55 60 

Pro Thr Gly Val Gin Asn Pro His lie Glu Arg Leu Arg Thr His Ser 
65 70 75 80 

lie Glu Ser Ser Gly Lys Leu Lys lie Ser Pro Glu Gin His Trp Asp 
85 90 95 

Phe Thr Ala Glu Asp Leu Lys Asp Leu Gly Glu lie Gly Arg Gly Ala 
100 105 110 

Tyr Gly Ser Val Asn Lys Met Val His Lys Pro Ser Gly Gin lie Met 
115 120 125 

Ala Val Lys Arg He Arg Ser Thr Val Asp Glu Lys Glu Gin Lys Gin 
130 135 140 

Leu Leu Met Asp Leu Asp Val Val Met Arg Ser Ser Asp Cy s Pro Tyr 
145 150 155 160 

He Val Gin Phe Tyr Gly Ala Leu Phe Arg Glu Gly Asp Cys Trp He 
165 170 175 

Cys Met Glu Leu Met Ser Thr Ser Phe Asp Lys Phe Tyr Lys Tyr Val 
180 185 190 

Tyr Ser Val Leu Asp Asp Val He Pro Glu Glu He Leu Gly Lys He 
195 200 205 

Thr Leu Ala Thr Val Lys Ala Leu Asn His Leu Lys Glu Aan Leu Lys 
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210 215 220 

He I la Hi» Arg Asp He Lys Pro Ser Asn He Leu Leu Asp Arg Ser 
225 230 235 240 

Gly Asn He Ly» Leu Cys Asp Phe Gly He Ser Gly Gin Leu Val Asp 
245 250 255 

Ser He Ala Lys Thr Arg Asp Ala Gly Cys Arg Pro Tyr Mel: Ala Pro 
260 265 270 

Glu Arg He Asp Pro Ser Ala Ser Arg Gin Gly Tyr Asp Val Arg Ser 
275 280 285 

Asp Val Trp Ser Leu Gly He Thr Leu Tyr Glu Leu Ala Thr Gly Arg 
290 295 300 

Phe Pro Tyr Pro Lys Trp Asn Ser Val Phe Asp Gin Leu Thr Gin Val 
305 310 315 320 

Val Lys Gly Asp Pro Pro Gin Leu Ser Asn Ser Glu Glu Arg Glu Phe 
325 330 335 

Ser Pro Ser Phe He Asn Phe Val Asn Leu Cys Leu Thr Lys Asp Glu 
340 345 350 

Ser Lys Arg. Pro Lys Tyr Lys Glu Leu Leu Lys His Pro Phe He Leu 
355 360 365 

Met Tyr Glu Glu Arg Ala Val Glu Val Ala Cys Tyr Val Cys Lys He 
370 375 380 

Leu Asp Gin Met Pro Ala Thr Pro Ser Ser Pro Met Tyr Val Asp 
385 390 395 

(2) INFORMATION FOR SEQ ID NOtll: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH i 393 amino acids 
(8) TYPEs amino acid 

(C) STRANDEDNESS * not relevant 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Pro Lys Lys Lys Pro Thr Pro He Gin Leu Asn Pro Ala Pro Asp 
1 5 10 15 

Gly Ser Ala Val Asn Gly Thr Ser Ser Ala Glu Thr Asn Leu Glu Ala 
20 25 30 

Leu Gin Lys Lys Leu Glu Glu Leu Glu Leu Asp Glu Gin Gin Arg Lys 
35 40 45 

Arg Leu Glu Ala Phe Leu Thr Gin Lys Gin Lys Val Gly Glu Leu Lys 
50 55 60 

Asp Asp Asp Phe Glu Lys He Ser Glu Leu Gly Ala Gly Asn Gly Gly 
65 70 75 80 

Val Val Phe Lys Val Ser His Lys Pro Ser Gly Leu Val Met Ala Arg 
85 90 95 
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Lys Leu lie His Leu Glu lie Lys Pro Ala lie Arg Asn Gin lie lie 
100 105 110 

Arg Glu Leu Gin Val Leu His Glu Cys Asn Ser Pro Tyr lie Val Gly 
115 120 125 

Phe Tyr Gly Ala Phe Tyr ser Asp Gly Glu He Ser He Cys Met Glu 
130 135 140 

His Met Asp Gly Gly Ser Leu Asp Gin Val Leu Lys Lys Ala Gly Arg 
145 150 155 160 

He Pro Glu Gin lie Leu Gly Lys Val Ser He Ala Val He Lys Gly 
165 170 175 

Leu Thr Tyr Leu Arg Glu Lys His Lys He Met His Arg Asp Val Lys 
180 185 190 

Pro Ser Asn He Leu Val Asn Ser Arg Gly Glu He Lys Leu Cys Asp 
195 200 205 

Phe Gly Val Ser Gly Gin Leu He Asp Ser Met Ala Asn Ser Phe Val 
210 215 220 

Gly Thr Arg Ser Tyr Met Ser Pro Glu Arg Leu Gin Gly Thr His Tyr 
225 230 235 240 

*** 

Ser Val Gin Ser Asp He Trp Ser Met Gly Leu Ser Leu Val Glu Met 
245 250 255 

Ala Val Gly Arg Tyr Pro He Pro Pro Pro Asp Ala Lys Glu Leu Glu 
260 265 270 

Leu Met Phe Gly Cys Gin Val Glu Gly Asp Ala Ala Glu Thr Pro Pro 
275 280 285 

Arg Pro Arg Thr Pro Gly Arg Pro Leu Ser Ser Tyr Gly Met Asp Ser 
290 295 300 

Arg Pro Pro Met Ala He Phe Glu Leu Leu Asp Tyr He Val Asn Glu 
305 310 315 320 

Pro Pro Pro Lys Leu Pro Ser Gly Val Phe Ser Leu Glu Phe Gin Asp 
325 330 335 

Phe Val Asn Lys Cys Leu He Lys Asn Pro Ala Glu Arg Ala Asp Leu 
340 345 350 

Lys Glh Leu Met Val His Ala Phe He Lys Arg ser Asp Ala Glu Glu 
355 360 365 

Val Asp Phe Ala Gly Trp Leu Cys Ser Thr He Gly Leu Asn Gin Pro 
370 375 380 

Ser Thr Pro Thr His Ala Ala Gly Val 
385 390 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION t SEQ ID NOtl2: 

Met Leu Ala Arg Arg Lys Pro Val Leu Pro Ala Leu Thr lie Asn Pro 
1-5 10 15 

Thr lie Ala Glu Gly Pro Ser Pro Thr Ser Glu Gly Ala Ser Glu Ala 
20 25 30 

Aen Leu Val Asp Leu Gin Lye Lye Leu Glu Glu Leu Glu Leu Asp Glu 
35 40 45 

Gin Gin Lye Lys Arg Leu Glu Ala Phe Leu Thr Gin Lys Ala Lys Val 
50 55 60 

Ser Glu Leu Lys Asp Asp Asp Phe Glu Arg lie Ser Glu Leu Gly Ala 
65 70 75 80 

Gly Asn Gly Gly Val Val Thr Lys Val Gin His Arg Pro Ser Gly Leu 
85 90 95 

lie Met Ala Arg Lys Leu lie His Leu Glu lie Lys Pro Ala He Arg 
100 105 110 

Asn Gin lift. He Arg Glu Leu Gin Val Leu His Glu Cys Asn Ser Pro 
115 120 125 

Tyr He Val Gly Phe Tyr Gly Ala Phe Tyr Ser Asp Gly Glu He Ser 
130 135 140 

He Cys Met Glu His Met Asp Gly Gly Ser Leu Asp Gin Val Leu Lys 
145 150 155 160 

Glu Ala Lys Arg He Pro Glu Glu He Leu Gly Lys Val Ser He Ala 
165 170 175 

Val Leu Arg Gly Leu Ala Tyr Leu Arg Glu Lys His Gin He Met His 
180 185 190 

Arg Asp Val Lys Pro Ser Asn He Leu Val Asn Ser Arg Gly Glu He 
195 200 205 

Lys Leu Cys Asp Phe Gly Val Ser Gly Gin Leu He Asp Ser Met Ala 
210 215 220 

Asn Ser Phe Val Gly Thr Arg Ser Tyr Met Ala Pro Glu Arg Leu Gin 
225 230 235 240 

Gly Thr His Tyr Ser Val Gin Ser Asp He Trp Ser Met Gly Leu Ser 
245 250 255 

Leu Val Glu Leu Ala Val Gly Arg Tyr Pro He Pro Pro Pro Asp Ala 
260 265 270 

Lys Glu Leu Glu Ala He Phe Gly Arg Pro Val Val Asp Gly Glu Glu 
275 280 285 

Gly Glu Pro His Ser He Ser Pro Arg Pro Arg Pro Pro Gly Arg Pro 
290 295 300 
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Val Ser Gly His Gly Met Asp Ser Arg Pro Ala Met Ala lie Phe Glu 
305 310 315 320 

Leu Leu Asp Tyr He Val Aan Glu Pro Pro Pro Lys Leu Pro Aon Gly 
325 330 335 

Val Phe Thr Pro Asp Phe Gin Glu Phe Val Asn Lys Cys Leu He Lys 
340 345 350 

Asn Pro Ala Glu Arg Ala Asp Leu Lys Met Leu Thr Aan His Thr Phe 
355 360 365 

He Lys Arg Ser Glu Val Glu Glu Val Asp Phe Ala Gly Trp Leu Cys 
370 375 380 

Lys Thr Leu Arg Leu Asn Gin Pro Gly Thr Pro Thr Arg Thr Ala Val 



(2) INFORMATION FOR SEQ ID NO: 13s 

(1) SEQUENCE CHARACTERISTICS t 

(A) LENGTH* 668 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



Met Glu Asp Lys Phe Ala Asn Leu Ser . Leu His Glu Lys Thr Gly Lys 
1 5 10 15 

Ser Ser lie Gin Leu Asn Glu Gin Thr Gly Ser Asp Asn Gly Ser Ala 
20 25 30 

Val Lys Arg Thr Ser Ser Thr Ser Ser His Tyr Asn Asn He Asn Ala 
35 40 45 

Asp Leu His Ala Arg Val Lys Ala Phe Gin Glu Gin Arg Ala Leu Lys 
50 55 60 

Arg Ser Ala Ser Val Gly Ser Asn Gin Ser Glu Gin Asp Lys Gly Ser 
65 70 75 80 

Ser Gin Ser Pro Lys His He Gin Gin lie Val Asn Lys Pro Leu Pro 
85 90 95 

Pro Leu Pro Val Ala Gly Ser Ser Lys Val Ser Gin Arg Met Ser Ser 
100 105 110 

Gin Val Val Gin Ala Ser Ser Lys Ser Thr Leu Lys Asn Val Leu Asp 
115 120 125 

Asn Gin Glu Thr Gin Asn He Thr Asp Val Asn lie Asn He Asp Thr 
130 135 140 

Thr Lys He Thr Ala Thr Thr Ha Gly Val Asn He Gly Leu Pro Ala 
145 150 155 160 

Thr Asp He Thr Pro Ser Val Ser Asn Thr Ala Ser Ala Thr His Lys 



385 



390 



395 



400 



165 



170 



175 
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Ala Gin Leu Leu Asn Pro Asn Arg Arg Ala Pro Arg Arg Pro Leu Ser 
180 185 190 

Thr Gin His Pro Thr Arg Pro Asn Val Ala Pro Hia Lya Ala Pro Ala 
195 200 205 

I la I la Aan Thr Pro Lya Gin Ser Leu Ser Ala Arg Arg Gly Leu Lya 
210 215 220 

Leu Pro Pro Gly Gly Met Ser Leu Lya Met Pro Thr Lya Thr Ala Gin 
225 230 235 240 

Gin Pro Gin Gin Phe Ala Pro Ser Pro Ser Aan Lya Lya Hia lie Glu 
245 250 255 

Thr Leu Ser Aan Ser Lya Val Val Glu Gly Lya Arg Ser Aan Pro Gly 
260 265 270 

Ser Leu lie Aan Gly Val Gin Ser Thr Ser Thr Ser Ser Ser Thr Glu 
275 280 285 

Gly Pro Hia Aap Thr Val Gly Thr Thr Pro Arg Thr Gly Aan Ser Aan 
290 295 300 

Aan Ser Ser Aan Ser Gly Ser Ser Gly Gly Gly Gly Leu Phe Ala Aan 

305 310 315 320 

Phe Ser Lya Tyr Val Aap lie Lya Ser Gly Ser Leu Aan Phe Ala Gly 
325 330 335 

Lya Leu Ser Leu Ser Ser Lya Gly He Aap Phe Ser Aan Gly Ser Ser 
340 345 350 

Ser Arg He Thr Leu Aap Glu Leu Glu Phe Leu Aap Glu Leu Gly His 
355 360 365 

Gly Aan Tyr Gly Aan Val Ser Lya Val Leu Hia Lya Pro Thr Aan Val 
370 375 380 

He Met Ala Thr Lys Glu Val Arg Leu Glu Leu Aap Glu Ala Lya Phe 
385 390 395 400 

Arg Gin He Leu Met Glu Leu Glu Val Leu Hia Lya Cya Aan Ser Pro 
405 410 415 

Tyr He Val Aap Phe Tyr Gly Ala Phe Phe He Glu Gly Ala Val Tyr 
420 425 430 

Met Cya Met Glu Tyr Met Aap Gly Gly Ser Leu Aap Lya He Tyr Asp 
435 440 445 

Glu Ser Ser Glu He Gly Gly He Aap Glu Pro Gin Leu Ala Phe He 
450 455 460 

Ala Aan Ala Val He Hia Gly Leu Lys Glu Leu Lys Glu Gin His Asn 
465 470 475 480 

He He His Arg Asp Val Lys Pro Thr Asn He Leu Cya Ser Ala Asn 
485 490 495 

Gin Gly Thr Val Lys Leu Cys Asp Phe Gly Val Ser Gly Asn Leu Val 
500 505 510 
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Ala ser Leu Ala Lys Thr Asn I la Gly Cys Gin Ser Tyr Met Ala Pro 
515 520 525 

Glu Arg I la Lys Ser Leu Asn Pro Asp Arg Ala Thr Tyr Thr Val Gin 
530 535 540 

Ser Asp lie Trp Ser Leu Gly Leu Ser lie Leu Glu Met Ala Leu Gly 
545 550 555 560 

Arg Tyr Pro Tyr Pro Pro Glu Thr Tyr Asp Asn lie Phe Ser Gin Leu 
565 570 575 

Ser Ala lie Val Asp Gly Pro Pro Pro Arg Leu Pro Ser Asp Lys Phe 
580 585 590 

Ser Ser Asp Ala Gin Asp Phe Val Ser Leu Cys Leu Gin Lys lie Pro 
595 600 605 

Glu Arg Arg Pro Thr Tyr Ala Ala Leu Thr Glu His Pro Trp Leu Val 
610 615 620 

Lys Tyr Arg Asn Gin Asp Val His Met Ser Glu Tyr lie Thr Glu Arg 
625 630 635 640 

Leu Glu Arg Arg Asn Lys lie Leu Arg Glu Arg Gly Glu Asn Gly Leu 
645 650 655 

Ser Lys Asn Val Pro Ala Leu His Met Gly Gly Leu 
660 665 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TTYTAYGGNG CNTTYTTYAT HGA 23 

(2) INFORMATION FOR SEQ ID NO: 15: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: " 
ATBCTYTCNG GNGCCATKTA 20 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16* 
ASTYRYSASA SASASYS 
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